首页> 外文OA文献 >Adversarial Network Bottleneck Features for Noise Robust Speaker Verification

【2h】

Adversarial Network Bottleneck Features for Noise Robust Speaker Verification

机译：噪声鲁棒音箱的对抗性网络瓶颈特征验证

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a noise robust bottleneck feature representationwhich is generated by an adversarial network (AN). The AN includes two cascadeconnected networks, an encoding network (EN) and a discriminative network (DN).Mel-frequency cepstral coefficients (MFCCs) of clean and noisy speech are usedas input to the EN and the output of the EN is used as the noise robustfeature. The EN and DN are trained in turn, namely, when training the DN, noisetypes are selected as the training labels and when training the EN, all labelsare set as the same, i.e., the clean speech label, which aims to make the ANfeatures invariant to noise and thus achieve noise robustness. We evaluate theperformance of the proposed feature on a Gaussian Mixture Model-UniversalBackground Model based speaker verification system, and make comparison to MFCCfeatures of speech enhanced by short-time spectral amplitude minimum meansquare error (STSA-MMSE) and deep neural network-based speech enhancement(DNN-SE) methods. Experimental results on the RSR2015 database show that theproposed AN bottleneck feature (AN-BN) dramatically outperforms the STSA-MMSEand DNN-SE based MFCCs for different noise types and signal-to-noise ratios.Furthermore, the AN-BN feature is able to improve the speaker verificationperformance under the clean condition.

机译：在本文中，我们提出了一种由对抗网络（AN）生成的噪声鲁棒瓶颈特征表示。 AN包含两个级联网络，一个编码网络（EN）和一个判别网络（DN）。干净和嘈杂的语音的梅尔频谱倒谱系数（MFCC）用作EN的输入，而EN的输出用作强大的噪音功能。依次训练EN和DN，即在训练DN时，选择噪声类型作为训练标签，而在训练EN时，所有标签都设置为相同，即干净的语音标签，目的是使ANfeatures不变噪声，从而达到噪声鲁棒性。我们在基于高斯混合模型-通用背景模型的说话人验证系统上评估提出的功能的性能，并与通过短时频谱幅度最小均方误差（STSA-MMSE）和基于深度神经网络的语音增强功能增强的MFCC语音功能进行比较。（DNN-SE）方法。 RSR2015数据库上的实验结果表明，针对不同的噪声类型和信噪比，拟议的AN瓶颈功能（AN-BN）明显优于基于STSA-MMSE和DNN-SE的MFCC。此外，AN-BN功能能够在干净的条件下提高扬声器的验证性能。

著录项

作者
Yu, Hong; Tan, Zheng-Hua; Ma, Zhanyu; Guo, Jun;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Feature Mapping and Recuperation by Using Elliptical Basis Function Networks for Robust Speaker Verification [J] . LI Xin, ZHENG Yu, JIANG Fang-Ze Journal of Shanghai University . 2002,第4期

机译：使用椭圆基函数网络进行特征映射和复原以进行可靠的说话人验证
2. Feature Mapping and Recuperation by Using Elliptical Basis Function Networks for Robust Speaker Verification [J] . LI Xin, ZHENG Yu, JIANG Fang-Ze Journal of Shanghai University . 2002,第4期

机译：使用椭圆基函数网络进行特征映射和复原以进行可靠的说话人验证
3. Feature Mapping and Recuperation by Using Elliptical Basis Function Networks for Robust Speaker Verification [J] . 李昕, 郑宇, 等上海大学学报：英文版 . 2002,第004期

机译：使用椭圆基函数网络进行特征映射和复原以进行可靠的说话人验证
4. Multi-Task Adversarial Network Bottleneck Features for Noise-Robust Speaker Verification [C] . Hong Yu, Tianrui Hu, Zhanyu Ma, International Conference on Network Infrastructure and Digital Content . 2018

机译：多任务对抗性网络瓶颈功能，用于验证噪声强的说话人
5. Feature and model transformation techniques for robust speaker verification. [D] . Yiu, Kwok Kwong. 2005

机译：功能和模型转换技术可实现可靠的说话人验证。
6. The Evolution of Heterogeneities Altered by Mutational Robustness Gene Expression Noise and Bottlenecks in Gene Regulatory Networks [O] . Zhihua Zhang -1

机译：基因调控网络中突变健壮性基因表达噪声和瓶颈改变了异质性
7. Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification [O] . Michelsanti, Daniel, Tan, Zheng-Hua 2017

机译：用于语音增强和语音的条件生成对抗网络噪声稳健的扬声器验证

Adversarial Network Bottleneck Features for Noise Robust Speaker Verification

摘要

著录项

相似文献

相关主题

期刊订阅